Data Integration over NoSQL Stores Using Access Path Based Mappings
نویسندگان
چکیده
Due to the large amount of data generated by user interactions on the Web, some companies are currently innovating in the domain of data management by designing their own systems. Many of them are referred to as NoSQL databases, standing for ’Not only SQL’. With their wide adoption will emerge new needs and data integration will certainly be one of them. In this paper, we adapt a framework encountered for the integration of relational data to a broader context where both NoSQL and relational databases can be integrated. One important extension consists in the efficient answering of queries expressed over these data sources. The highly denormalized aspect of NoSQL databases results in varying performance costs for several possible query translations. Thus a data integration targeting NoSQL databases needs to generate an optimized translation for a given query. Our contributions are to propose (i) an access path based mapping solution that takes benefit of the design choices of each data source, (ii) integrate preferences to handle conflicts between sources and (iii) a query language that bridges the gap between the SQL query expressed by the user and the query language of the data sources. We also present a prototype implementation, where the target schema is represented as a set of relations and which enables the integration of two of the most popular NoSQL database models, namely document and a column family stores.
منابع مشابه
OBDA Over Non-Relational Databases
The database landscape has been significantly diversified during the last decade, resulting in the emergence of a variety of non-relational (also called NoSQL) databases, e.g., XML and JSON-document databases, key-value stores, and graph databases. To facilitate access to such databases and to enable data integration of non-relational data sources, we generalize the well-known ontologybased dat...
متن کاملFinding and Fixing Type Mismatches in the Evolution of Object-NoSQL Mappings
NoSQL data stores are popular backends for managing big data that is evolving over time: Due to their schema-flexibility, a new release of the application does not require a full migration of data already persisted in production. Instead, using object-NoSQL mappers, developers can specify lazy data migrations that are executed on-the-fly, when a legacy entity is loaded into the application. Thi...
متن کاملOBDA Beyond Relational DBs: A Study for MongoDB
The database landscape has been significantly diversified during the last decade, resulting in the emergence of a variety of non-relational (also called NoSQL) databases, e.g., XML and JSON-document databases, key-value stores, and graph databases. To facilitate access to such databases and to enable data integration of non-relational data sources, we generalize the well-known ontologybased dat...
متن کاملOntoMongo- Ontology-Based Data Access for NoSQL
Ontology-based data access (OBDA) has gained attention in recent years for providing access to large volumes of data by using ontologies as a conceptual layer and exploring their ability to describe domains and deal with data incompleteness. This is done through mappings that connect the data in the database to the vocabulary of the ontology. The first OBDA studies were about data stored in rel...
متن کاملOn Bridging Relational and Document-Centric Data Stores
Big Data scenarios often involve massive collections of nested data objects, typically referred to as “documents.” The challenges of document management at web scale have stimulated a recent trend towards the development of document-centric “NoSQL” data stores. Many query tasks naturally involve reasoning over data residing across NoSQL and relational “SQL” databases. Having data divided over s...
متن کامل